A meta-indexing method for fast probably approximately correct nearest neighbor searches
نویسندگان
چکیده
Abstract In this paper we present an indexing method for probably approximately correct nearest neighbor queries in high dimensional spaces capable of improving the performance any index whose degrades with increased dimensionality query space. The basic idea is quite simple: use SVD to concentrate variance inter-element distance a lower space, Ξ. We do space and then “peek” forward from by gathering all elements less than $d_{\Xi }(1+\zeta \sigma _{\Xi }^{2})$ d Ξ ( 1 + ζ σ 2 ) , where d Ξ Ξ, $\sigma }^{2}$ data ζ parameter. All thus collected form tentative set T which scan using complete feature find point closest query. advantages are that (1) it can be built on top virtually (2) build model distribution error precise enough allow designing compromise between speed. show improvement obtain SUN base.
منابع مشابه
Probably Approximately Correct Learning
This paper surveys some recent theoretical results on the efficiency of machine learning algorithms. The main tool described is the notion of Probably Approximately Correct (PAC) 1 earning, introduced by Valiant. We define this learning model and then look at sorne of the results obtained in it. We then consider some criticisms of the PAC model and the extensions proposed to address these criti...
متن کاملProbably Approximately Correct Learning
Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm. of empirical and explanation-based learning algo
متن کاملProbably Approximately Correct Search
We consider the problem of searching a document collection using a set of independent computers. That is, the computers do not cooperate with one another either (i) to acquire their local index of documents or (ii) during the retrieval of a document. During the acquisition phase, each computer is assumed to randomly sample a subset of the entire collection. During retrieval, the query is issued...
متن کاملFast indexing method for multidimensional nearest-neighbor search
This paper describes a snapshot of work in progress on the development of an eecient le-access method for similarity searching in high-dimensional vector spaces. This method has applications in, for example, image databases where images are accessed via high-dimensional feature vectors. The technique is based on using a collection of space-lling curves as an auxiliary indexing structure. Initia...
متن کاملProbably correct k-nearest neighbor search in high dimensions
A novel approach for k-nearest neighbor (k-NN) searching with Euclidean metric is described. It is well known that many sophisticated algorithms cannot beat the brute-force algorithm when the dimensionality is high. In this study, a probably correct approach, in which the correct set of k-nearest neighbors is obtained in high probability, is proposed for greatly reducing the searching time. We ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Multimedia Tools and Applications
سال: 2022
ISSN: ['1380-7501', '1573-7721']
DOI: https://doi.org/10.1007/s11042-022-12690-w